Qwen 3.6-27B

mentions 1 type Person feed RSS

// recent coverage 1 mentions

04:00

2026-06-16

arxiv.org

large-language-models

Are Online Skill and Memory Modules Always Worth Their Tokens? A Budget-Constrained Study of Web Agents

A new study finds that online web agents augmented with memory, workflow, or skill modules often fail to outperform a token-matched vanilla baseline under a fixed inference budget. Testing across thre…

// co-occurs with top 7 entities

Gemini 3 Flash 1 GPT-5.4-mini 1 WebArena 1 WorkArena-L1 1 AWM 1 ASI 1 ReasoningBank 1